Skip to content

Conversation

muhammad-tanvir-1211
Copy link

This PR fixes the calls to BlockDim* and GridDim* through SYCL. The current changes give incorrect output if you run a CUDA kernel through SYCL on NVIDIA A100.

Co-authored-by: Mehdi Goli <[email protected]>
Copy link

@aacostadiaz aacostadiaz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for fixing this

@aacostadiaz aacostadiaz merged commit 6a8aee6 into intel:sycl-develop Apr 29, 2024
AD2605 pushed a commit to AD2605/cutlass-fork that referenced this pull request May 24, 2024
Fix the calls to BlockDim* and GridDim* through SYCL. The current changes give incorrect output if you run a CUDA kernel through SYCL on NVIDIA A100.

Co-authored-by: Mehdi Goli <[email protected]>

---------

Co-authored-by: Mehdi Goli <[email protected]>
aacostadiaz pushed a commit that referenced this pull request Jul 16, 2024
Fix the calls to BlockDim* and GridDim* through SYCL. The current changes give incorrect output if you run a CUDA kernel through SYCL on NVIDIA A100.

Co-authored-by: Mehdi Goli <[email protected]>

---------

Co-authored-by: Mehdi Goli <[email protected]>
aacostadiaz pushed a commit to aacostadiaz/cutlass-fork that referenced this pull request Aug 6, 2024
Fix the calls to BlockDim* and GridDim* through SYCL. The current changes give incorrect output if you run a CUDA kernel through SYCL on NVIDIA A100.

Co-authored-by: Mehdi Goli <[email protected]>

---------

Co-authored-by: Mehdi Goli <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants